-
Notifications
You must be signed in to change notification settings - Fork 572
[fbgemm_gpu] Remove sm_100
and sm_120
#4024
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
q10
commented
Apr 25, 2025
•
edited
Loading
edited
- Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (Enable FP4 CUTLASS GEMM and CUDA quantization kernels #4004), since we are running into the following error:
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
f7a44b8
to
fdfbe68
Compare
sm_100
and sm_120
@q10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
1 similar comment
@q10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Summary: X-link: facebookresearch/FBGEMM#1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10
fdfbe68
to
12eea6a
Compare
This pull request was exported from Phabricator. Differential Revision: D73901832 |
Summary: X-link: facebookresearch/FBGEMM#1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10
12eea6a
to
35984fd
Compare
This pull request was exported from Phabricator. Differential Revision: D73901832 |
Summary: X-link: https://github.com/facebookresearch/FBGEMM/pull/1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Pull Request resolved: pytorch#4024 Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10 fbshipit-source-id: 690c58b214aee80374e43a93bf39fe70e430da9a